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Abstract 

We answer a problem posed in [GKM08] regarding a restricted model of small-space computation, 
tailored for solving the GEN problem. They define two variants of "incremental branching programs", 
the syntactic variant defined by a restriction on the graph-theoretic paths in the program, and the more- 
general semantic variant in which the same restriction is enforced only on the consistent paths - those 
that are followed by at least one input. They show that exponential size is required for the syntactic 
variant, but leave open the problem of superpolynomial lower bounds for the semantic variant. Here we 
give an ex ponential lower bound for the semantic variant by generalizing lower bound arguments from 
[BCM + 09] |WehlO| for a similar restricted model tailored for solving a special case of GEN called Tree 
Evaluation. 
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1 How to read this paper 



The introduction [2] (which is short and should be read entirely) defers several definitions to Section 13.11 
the reader should refer there to read any unfamiliar definitions as they arise. The proof of our main result 
spans Sections 13.21 14.11 14.31 All but the most-casual readers should read Section 13.21 which sets up and 
outlines the proof. Sections [3.31 and B~2l can safely be skipped by readers only interested in our main result. 

2 Introduction 

An instance T of m-GEN is just a function from [to] x [to] to [to], where [to] = {1, . . . , to}. T is a YES 
instance iff to is in the closure of the set {1} under the operation T. Depending on the computation model, 
T is given as m 2 [log to] bits or, more naturally, as to 2 elements of [to]. The computation model we use, 
to- way branching programs (defn [1} , is the standard one for studying the nonuniform space complexity of 
problems represented in the second way. 

We refer to the to 2 , [m]-valucd input variables that define the to-GEN instances by the names {(x, y)}x,ye[rr, 
and we refer to input T's value of variable (x, y) by T(x, y). Throughout, we only talk about deterministic 
branching programs. A deterministic semantic-incremental branching program (BP) solving to-GEN 
is an TO-way BP B that computes to-GEN such that for every state q of B that queries a variable (x, y) and 
every input T that visits q, for both z S {x, y] either z = 1 or there is an earlier edge on the computation 
path of T labeled z. The main goal is Corollary [T] in Section 1431 

There is a constant c > such that for infinitely-many to every deterministic semantic-incremental 
BP solving to-GEN has at least 2 cra / logm states. 

3 Preliminaries / Outline 

3.1 Definitions 

Definition 1 (fc-way branching program). A deterministic k-way branching program B computing a function 
g : [fc] Vars \ — > Out, where Vars and Out are finite sets, is first of all a directed multi-graph whose nodes are 
called states, having a unique in-degree state called the start state. Every state is labeled with an input 
variable (an element of Vars) except for \Out\ output states with out-degree labelled with distinct output 
values (the elements of Out). Every state has k out-edges, labeled with distinct elements of [k]. An input 
I (a mapping X i— > X 1 from Vars to [k]) defines a computation path from the start state through B in the 
obvious way: from a non-output state q labeled with X G Vars, I follows the edge out of q labeled X 1 . 
The computation path of / must be finite, ending at the output state labeled with g(I). The size of B is its 
number of states. We say that B solves a decision problem if | Out\ = 2. 

Definition 2 (rooted dag, root u roo t, leaf, child, parent, arc, size). A rooted dag G is a directed acyclic 
graph with a unique out-degree node called the root, denoted u roo t- In-degree nodes are called leaves. 
We refer to the edges of G as arcs in order to avoid confusion with the edges of a branching program. The 
nodes with arcs into u are the children of u and the nodes that receive arcs coming out of u are the parents 
of u. 

Definition 3 (Dag Evaluation Problem). An input is a 4-tuple (G, k, I, f). G is a connected rooted dag and 
k > 2 is an integer. I consists of a [log fc] -bit string specifying a value in [k] for each leaf node of G, and 
/ consists of a fc d [log fc~|-bit string specifying a function from [k] d to [k] for each non-leaf node of G with d 
children. Each non-leaf receives a value in the expected way; namely, by applying its function to the values 
of its children. The function version of the problem asks for the value of the root. The decision version asks 
if the root value is 1. 
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The next definition subsumes the previous one; it makes precise how inputs to the Dag Evaluation 
Problem are presented to fc-way BPs, and introduces notation that we will use throughout this paper. The 
variable G denotes a connected rooted dag (sec definition [5]) with at least two nodes throughout this paper. 

Definition 4 (DEq : Dag Evaluation Problem for fixed dag G). The size of an input to DEq is determined 
by a parameter k > 2, and we write DEc(fc) for the problem restricted to inputs with size parameter k. The 
[fc]-valued input variables Vars of DEc(fe) are as-follows: 

l u for each leaf u € G 
fuifl) for each node u £ G of in-degree d > 1 and each a € [k] d 

We write 1^ and fu{a) for the value input I: Vars —> [k] assigns to variables l u and /«((?)• For u £ G we 
define u 1 , the value of I on u, inductively: if u is a leaf then u 1 = 1^, and if u has children vi, . . . ,Vd then 
u 1 = fu(v{, ■ ■ ■ ,Vj). DEc(fc) is a decision problem; the output is YES if uJ oot = 1 and NO otherwise. 

We generalize the definition from |BCM + 09] [WehlOj of deterministic thrifty BPs solving the Tree Eval- 
uation Problem (which is the Dag Evaluation Problem for the complete binary trees): 

Definition 5. A k-w&y BP solving DEc(fc) is thrifty if for every state q that queries an internal node variable 
/ u (ai, . . . , arf), if Vi, . . . , Vd are the children of u then every input I that visits q has v[ = a\, . . . ,vi = Qd- 

Definition 6 (Black pebbling cost of G). Let G be a rooted dag. A pebbling configuration C of G is 
given by a subset of the nodes of G which are said to be pebbled. A complete pebbling sequence for G 

is a sequence of pebbling configurations n = C\ , . . . , Cf such that every node is unpebbled in C\ , the root is 
pebbled in Ct, and for all £ £ {1, . . . , t* — 1}, configuration Ct+i is obtained from Ct by one of the following 
types of pebbling moves: 

1. If all the children of node u are pebble in Ct, then in Ct+i a pebble can placed on u and simultaneously 
zero or more of the children of u can have their pebbles removed. 

2. A pebble is removed from some nodc0 

We say tt requires p pebbles if p is the maximum over all Ct of the number of nodes pebbled in C't ■ Finally, 
the pebbling cost of G is the minimum number of pebbles required for a complete pebbling sequence for 
G. 

3.2 Outline of proof 

For arbitrary G and k, Theorem [1] gives lower bounds for thrifty BPs solving DEc(fc) in terms of k and 
the pebbling cost of G. Let T h be the complete binary tree with 2 h — 1 nodes. Theorem[T]is a generalization 
of the following result from [WehlOj . stated in terms of the notation introduced above: 

For any h, k > 2 every deterministic thrifty BP solving DE T h (k) has at least k h states. 

Theorem [3] uses Theorem Q] to get lower bounds for semantic-incremental BPs solving m-GEN in terms 
of the pebbling cost of dags with indegree 20 The bulk of that proof consists of showing that for any G 
with indegree 2, there is a polynomial-boundecH reduction g from DEq to GEN such that thrifty BPs can 
efficiently simulate semantic incremental BPs that solve instances of GEN from the range of 5 |3 Corollary Q] 
uses Theorem [3] for each member of a particular hard-to-pebble family of dags, whose existence was proved 
in |PTC76j . 

1 We don't actually need to include this as a possible move. 

2 This could be generalized to work for families of dags with unbounded indegree, but we have no use for that generalization 
here. 

3 And very efficiently computable, though we don't need that fact. 

4 More precisely, if E is the set of DEq(A;) instances, and there is a size s semantic incremental BP solving the set of GEN 
instances g(E), then there is a thrifty BP solving DEc(k) of size at most s. 
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3.3 Remarks on proofs by Gal, Koucky, McKenzie 



The authors of [GKM08 obtain exponential lower bounds for syntactic incremental BPs solving to-GEN 
in two ways. Both methods also work for a nondctcrministic variant of syntactic incremental BPs. First, 
they use the Raz/McKenzie lower bounds for monotone circuits RM99 to get a lower bound of 2™ for some 
e > and sufficiently large The first method works for a possibly-larger larger class of BPs, but the 
definition of that class is not simple@ Their second method uses a probabilistic argument^ combined with 
the same pebbling result that we use to get a lower bound of 2 c "/ log ™ for some c > and sufficiently large 
n. 



4 Results 

4.1 Lower Bound for Thrifty BPs 

Theorem 1. If G has pebbling cost p then for any k > 2 every thrifty deterministic BP solving DEc(k) has 
at least k p states^ 

Proof. Fix G, k and a deterministic thrifty BP B that solves DEc(fc). Let n be the number of nodes in G 
and Q the states of B. If it is a non-leaf node with d children then the u variables are /«(a) for each a £ [k] d , 
and if u is a leaf node then there is just one u variable l u . We sometimes say "/„ variable" just as an in-line 
reminder that u is a non-leaf node. When it is clear from the context that a state q is on the computation 
path of an input /, we just say "g queries u" instead of "q queries the thrifty u variable of 

We want to assign a black pebbling sequence to each input; to do this we need the following lemma. 

Lemma 1. For any input I and non-leaf node u, there is at least one state q on the computation path of 
I that queries and for every such q, for each child v of u there is a state on the computation path of I 
before q that queries v. 

Proof. Fix an input /. We prove the lemma for / starting with the root, and then the children of the root, 
and so on. Let v\, . . . , Vd be the children of w r0 ot- I must visit at least one state that queries its thrifty w roo t 
variable, since otherwise B would make a mistake on an input J that is identical to / except 



2 if<x>t = l 
f otherwise 



Now let u be any non-leaf node and q any state on the computation path of / that queries u. Suppose the 
lemma does not hold for this q, so for some child v of u there is no state before q that queries v. For every 
a ^ v 1 there is an input I a that is identical to / except v Ia = a. Now I a visits q since I and I a have the 
same computation path up to q; hence the thrifty assumption is violated. □ 

We define the pebbling sequence for each input / by following the computation path of I from beginning 
to end, associating the t-th state visited by / with the t-th pebbling configuration Ct, such that Ct+i is 
cither identical to Ct or follows from Ct by applying a valid pebbling move. Let qi,...,qp be the states on 
the computation path of / up to the state qt* immediately following the first state that queries the root; 
Cf will be the last configuration, and the only configuration where the root is pebbled. Note that q\ must 
query a leaf by Lemma [TJ We associate q± with the empty configuration C\ . 

Assume we have defined the configurations Ci, . . . , Ct associated with the first t < t* states, and assume 
Ci,...,C t is a valid sequence of configurations (where adjacent identical configurations are allowed), but 
neither it nor any prefix of it is a complete pebbling sequence. We also maintain that for all t' < t, if the 



6 See section 3.1 "Tight Computation of GEN" of [GKM08I . 
7 See Lemma 5.2 "Symmetrization Lemma" of IGKM081 . 

8 In section [3] we specified that G denotes a connected rooted DAG with at least two nodes. 
9 Recall that "queries u" means queries the thrifty u variable of I. 
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node queried by qy is not a leaf, then its children are pebbled in CV and it is not. Let u be the node queried 
by qt- By the I.H. u is not pebbled in Ct- We define Ct+i by saying how to obtain it by modifying Ct- 

1. If u is the root, then t + 1 = t* by the definition of qt* , and by the I.H. all the children of u are pebbled. 
Put a pebble on the root and remove the pebbles from its children. This completes the definition of 
the pebbling sequence for I. 

2. If u is not the root or a leaf, then by the I.H. all the children of u are pebbled. For each child v of u: 
if there is a state q' after q t that queries some parent of v, and no state between q t and q' that queries 
v, then leave the pebble on v, and otherwise remove it. 

3. If u is not the root, then place a pebble on it iff there is a state q' after q t that queries some parent of 
u and no state between q t and q' that queries u. 

Let p 1 be the maximum number of pebbled nodes over all the configurations we just defined. So p 1 > p 
since G has pebbling cost p. Let Ct be the earliest configuration with p 1 pebbled nodes. Later we will need 
that qt is not an output state, so we prove that now. 

It suffices to show t < t* , since then there must be at least one state q t +i (possibly an output state) after 
qt- We use the assumption that G is connected and has at least two nodes, so the root has degree d > 1. 
In the move from Ct*-i to Ct* one pebble is added and d pebbles are removed, so either Ct* has fewer 
than p 1 pebbled nodes (if d > 1) or else Ct*-i is an earlier configuration with p 1 pebbled nodes. Hence 
t < t*. 

Define the critical state r 1 for I to be qt- We refer to the nodes pebbled in Ct as the bottleneck nodes 
of /. The following fact is immediate from the pebbling sequence assignment. 

Fact 1. For any input I, if non-root node u has a pebble at a state q (i.e. the configuration associated with 
q), then there is a later state q' that queries some parent of u and no state between (inclusive) q and q' that 
queries u. 

Let D be the set of inputs I such that for every non-leaf node u, if v\, . . . , Vd are the children of u then 
fl(a) = 1 except possibly when a = (v{, . . . ,vf.}. So \D\ = k n . Let R be the states that are critical for at 
least one input in D, and for each r £ R let D r be the inputs in D with critical state r. The remainder of 
the proof of Theorem Q] is devoted to the proof of the next lemma. 

Lemma 2. \D r \ < k n ~ p for every r £ R 

Let us first see that the theorem follows from the lemma. Since {D r } re R is a partition of D, by the 
lemma there must be at least \D\/k n ~ p — k p sets in the partition, i.e. the set of critical states R has size at 
least which is what we wanted to show. 

Consider a very simple cooperative two player game where Player 1 chooses r £ R and an input / in 
D r and gives r to Player 2. Both players know the branching program B. Player 2's goal is to determine / 
(which is the only thing Player 1 knows that Player 2 does not), which by the definition of D is equivalent 
to determining the node values of I. Player 1 gets to send an advice strings to Player 2, and it is her goal 
to minimize the length of the advice strings. The lemma says that, for any critical state r chosen by Player 
1, advice strings in [k] n ~ p suffice to enable Player 2 to determine the input in D r chosen by Player 1. We 
refer to the individual elements from [k] of an advice string as words. 

Fix r in R. Let I £ D r be an input chosen by Player 1, unknown to Player 2. Player 2 will use the 
advice, together with r and the thrifty property of B, to follow the computation path taken by / from r till 
7's output state. We will define the advice string so that each word tells Player 2 the value of a different 
node; when Player 2 learns (the value of) a node in this way, we say he receives the value of (J on) that 
node. There will be at least p nodes -specifically, the bottleneck nodes of I- that Player 2 will not receive 
the values of, but by using the thrifty property he will learn them nonetheless; when Player 2 learns a node 
in this way, we say he deduces the value of that node. 

Let q be the state Player 2 is currently on, initially q = r. Let u be the node queried by q. Suppose u is 
an internal node, and let / u (ai, . . . , ad) be the variable queried by q and V\,...,Vd the children of u. Since 
B is thrifty, ai, . . . , o<j are the values of I on V\, . . . , Vd- Hence, for each «j, if Player 2 does not yet know 
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I(i>i) (meaning, he did not in some earlier state receive or deduce the value of Vi) then he deduces I{vi) = a; 
now. Next, Player 2 needs to decide what edge out of q to follow (Player 2 does this step for all nodes u, 
including leaf nodes). If for some a he learned I(u) = a at an earlier state, then he again takes the edge 
labeled a. Otherwise, we define the next unused word in the advice string to be I(u), and Player 2 uses that 
word now. 

It is clear that for some m < n, the protocol just defined will allow Player 2 to reach the output state 
of / and learn at least m node values along the way, using at most m words of advice. We will argue for a 
stronger proposition: for some m < n — p, a string of m words suffices to allow Player 2 to reach the output 
state and learn at least m+p nodes along the way. That will finish the proof of the lemma, since then we can 
use the remaining (n — p) — m words of the advice string for / for the values of the < n — (m + p) remaining 
nodes (ordered by some globally-fixed order on the nodes of G) that Player 2 has not yet learned. Now, by 
Fact[IJ f° r every bottleneck node u of /, some parent of u is queried at some state on the path from r to the 
output state of /. Furthermore, if u is ever queried on the path from r till the output state, then this must 
happen after some parent of u is queried. Hence, for every bottleneck node u, Player 2 will be able to deduce 
u before he is forced to use a word of the advice to receive the value of u. Since the nodes whose values are 
deduced by Player 2 are disjoint from the nodes whose values are received by Player 2, and Player 2 uses m 
words by assumption, in total Player 2 learns at least m + p node values^. □ 

4.2 Lower bound for Thrifty BPs using easy inputs 

For much of the proof of Theorem [TJ we only considered the behavior of the thrifty BP on inputs from 
the following set: 

Definition 7 (hard inputs for thrifty programs). For given dag G and k > 2, let Dc,k be the set of DEc(fc) 
inputs / such that for every non-leaf node u, if v±, . . . , v<i are the children of u then /„(a) = 1 except possibly 
when a = (y[ , . . . , v[). 

The sets Da,k are a small fraction of the DEc(fc) inputs, and it is not hard to see that separating the YES 
and NO instances of Da,k is easy for unrestricted BPs. The next result shows that the bound of Theorem 
[T] holds even for thrifty BPs that are only required to be correct on inputs from Dc,k- 

Theorem 2. If G has pebbling cost p then for any k>2 if B is a thrifty deterministic BP that computes a 
set consistent with DEQ^k) for the inputs Dq then B has at least k p states. 

Proof. In the proof of Theorem [TJ the only place we used the correctness of B on inputs outside of Da,k is 
in the proof of Lemma U We now proof it under the weaker assumptions. 

Lemma 3. For any input I £ Dq ^ and non-leaf node u, there is at least one state q on the computation 
path of I that queries u, and for every such q, for each child v of u there is a state on the computation path 
of I before q that queries v. 

Fix / 6 Dc.k- Once again we prove the lemma for / starting with the root, and then the children of the 
root, and so on. Let v%, . . . ,Vd be the children of u rc , t- I must visit at least one state that queries its thrifty 
Wroot variable, since otherwise B would make a mistake on an input J £ Dq^ that is identical to / except 



Now let u be any non-leaf node and suppose there is some state on the computation path of / that queries 
u for which the lemma does not hold. Let q be the earliest such state. So for some child v of u there is no 
state before q that queries v. For any n/o' there is an input I a £ Dc.k that is identical to / except v Ia = a 
and fu" (v{, . . . , Vj) = 1. Suppose there is no state q' before q on the computation path of / that queries u. 

10 We do not say exactly m + p because technically, according to the given protocol, if the bottleneck configuration assigned 
to input / has p' > p pebbles (which can happen), then Player 2 learns more than m + p node values. 
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Then / and I a have the same computation path up to q and so I a visits q also, which violates the thrifty 
assumption. Hence there is a state q' before q on the computation path of I that queries u. By our choice of 
q, the lemma must hold for q' . This is a contradiction since the states given by the conclusion of the lemma 
for q' satisfy the conclusion of the lemma for q as well. □ 

4.3 Lower Bound for Semantic-incremental BPs 

Theorem 3. If there is a rooted DAG G with n > 2 nodes, indegree 2 and pebbling cost p, then for any 
k > 2 and m = 'ikn + n + 1 every deterministic semantic incremental BP solving m-GEN has at least k p 
states. 

Proof. Let G,n,p, k,m be as in the statement of the theorem; these are fixed throughout the proof. The 
bulk of this argument is a reduction from DEcj(fc) to m-GEN; we map each instance I of DEg(fc) to an 
instance T 1 of m-GEN such that I is a YES instance iff T 1 is. Let E be the set of inputs for DEc(fc). We 
will show that if there is a semantic-incremental m-way BP of size s that computes m-GEN correctly on 
the inputs {T i }i^e, then there is a thrifty k-w&y BP of size at most s that computes DEc(fc). Then from 
Theorem Q] we get s > k p . 

Fix an order on the nodes of G. We will not differentiate between a node u and its index in [n] given by 
this order. Let u root £ [n] be the (index of) the root of G. We divide the elements {n + 2, . . . , m} into two 
parts, one of size nk and the other of size < 2nfc03 and refer to them by the following mnemonic names: 

• (u, a) for each node u and a £ [k]. T 1 generates this element iff u 1 = a. 

• (vu,a) for each arc vu and a £ [k]. T 1 generates this element iff v 1 = aP^I 

Any way of assigning those < 3nfc names to distinct elements of {n + 2, . . . , m} will suffice, except we require 
that (iiroot, 1) gets assigned to m, since then we will have that T 1 is a YES instance of m-GEN iff / is a 
YES instance of DEc(fc). Elements {1, . . . , n + 1} will be generated by every T 1 ; their purpose is technical. 
Now we give the reduction. Fix an instance / of DEo(k). First we make T 1 generate each of the elements 
{2, . . . , n + 1}. For each m£ [n]: 

T J (l,u) :=u+l (1) 

In a similar way, we make T 1 generate the elements (w, Z^) for each leaf w. Fix an order w\, . . . , wi on the 
leaf nodes. For each t 6 [I — 1] and a E [k]: 

T J (l,n+l):= 
^(1,(^,0)):= (uW^) 

For every non-leaf node u £ [n] and a, 6i,&2 £ [k], if v\ and V2 are the left and right children of u then 
we add the following definitions. Equations <j3j) simply propagate the value of a node to its out-arcs. Let 
b = fu{b\, 62). Equation (j4]) expresses: If / gives the left in-arc of u value 61 and the right in-arc of u value 
&2, then / gives u the value b = fl(bi, 62)- 

T z (u + 1, (vi,a)) := (viu,a) 
T ! (u + 1, (v 2 , a)) := (v 2 u, a) 

T I ({v 1 u,b 1 ) 1 {v 2 u 1 b 2 )):={u 1 b) (4) 

Let us call a variable (x,y) used if T I (x,y) is defined at this point, and unused otherwise. Examining the 
left sides of equations ©-(H]), it is clear that the set of used variables depends only on G and k (not on J). 
For every unused variable (x, y) define 

T T (x,y)~l (5) 

1 2n is a bound on the number of edges since non-leaf nodes have indegree at most 2. 

12 These elements may seem redundant, given the elements (u, a). However, that is correct only if G has the property that 
no two nodes U1.U2 have the same children. 
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That completes the definition of T 1 . It is straightforward to show that J is a YES instance of DEc(fc) iff T 1 
is a YES instance of m-GEN. 

Now we show how to convert, without increasing the size, a semantic-incremental m-way BP B that 
computes m-GEN correctly on the inputs {T I }j e E, into a thrifty fc-way BP that computes DEc(fc)- From 
the above definition of the inputs {T i }]^e, an m-GEN variable (x, y) is a used variable iff it has exactly one 
of the following four forms, where [T] corresponds to Equation (fTJ, and HT1 corresponds to Equation @, etc: 

i. (1, it) for some u G [n] 

ii. (1, (wt, a)) for some leaf node wt and a G [k] 

iii. (u + 1, (v, 6)) for some b G [k] and some non-leaf node u G [n] with child v 

iv. ((viU, bi) , (v 2 u, b 2 )) for some non-leaf node it with children vi, v 2 and 61, 62 € [&] 

We say (x,y) is a type |T] variable iff it has the form of |T] above, and type HT1 llxl , t [jv] variables are defined 
analogously. Only variables of type HP and liv.l will be translated to DEc(k) variables. The remaining used 
variables, of types HI and liii.i are "dummy" variables, in the sense that the right sides of the corresponding 
defining equations (TTJ) and ^ do not depend on /. So for any state q in B that is labeled with a dummy 
variable, there is at most one edge out of q that any input T 1 can take; as a consequence these states will 
eventually be deleted. For the same reason, states labeled with unused variables will also be deleted. 

Recall the ordering on the leaf nodes wi, . . . , wi that we fixed earlier. Let q be a state of B that queries 
a variable (x,y). If (x, y) is an unused variable then delete all edges out of q except the one labeled 1. 
Otherwise (x,y) is one of the variable types lIT-nvl and should be handled as follows: 

i. (x,y) = (1, u) for some u G [n]. Delete every edge out of q except the one labeled u + 1. 

ii. If (x, y) = 1), then delete every edge out of q except for the k edges labeled (wi,a) for a G [k]. 
Otherwise (x,y) = (1, (wt,a)) for some t G [I] and a G [k]. Delete every edge out of q except for the k 
edges labeled (wt+i,b) for b G [k]. 

iii. (x, y) = (it + 1, (v, b)) for some b G [k] and nodes u, v such that v is a child of u. Delete every edge out 
of q except the one edge labeled (vu, b). 

iv. (x,y) = ((v\u,bi) , (V2U, 62)) for some non-leaf node u with children vi,V2 and 61,62 G [k]. Delete every 
edge out of q except the k edges labeled (it, a) for a G [k]. 

At this point, a non-output state q has outdegree zero or one iff it is labeled with an unused or dummy 
variable. For any non-output state q with outdegree zero, change q to a reject state. Now consider q with 
outdegree one. Note its out-edge must be labeled with an element of the same form as the right side of one 
of Equations |T|), (J3j) or (|5|). Let q' be the unique state that q transitions to. For every state q" with an 
out-edge e to q, move the target node of e from q to q'. After all the edges into q have been moved, delete 
q. The last step is to rename the variable labels on the remaining states. Note that only types HT] and liv.l 
remain. Rename them as follows: 

(l,7t + 1) ^ l Wl (1, (w t , a)) H> l Wt+1 

((viu, 61) , (v 2 u, b 2 )) i-> f u (bi,b 2 ) 

Let B' be the resulting branching program. Since we have obtained B' only by renaming variables and 
deleting edges and states, clearly its size is no greater than that of B. We need that B accepts T 1 iff B' 
accepts / for every I G E. It is clear that "bypassing" and then deleting the states labeled by unused and 
dummy variables, in the way done above, has no effect on any input T 1 . Also, one can check that for every 
edge e we removed before deleting the dummy states, none of the inputs T 1 can take edge e. Finally, B' is 
thrifty precisely because B is semantic-incremental. □ 

Corollary 1. There is a constant c > such that for infinitely-many m every deterministic semantic- 
incremental BP solving m-GEN has at least 2 cm / lo s m states. 

Proof. We will need a family of rooted DAGs that are much harder to pebble than the complete binary trees. 
JPTC76] provides such a family: 
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There is a family of rooted DAGs {Gt}t>\ with Q(t) nodes and indegree two whose black pebbling 
cost is Q(t/ logi). 

Let n t be the number of nodes in and let p(n t ) be the pebbling cost of Gt as a function of n t . So 

p{nt) = Sl(nt/ log nt). We use Theorem[3]on each of the Gt with k = 2. For mt := 3kn t + n t + 1 = 7n t + 1 < 
8n t , for every t we get lower bounds of 2 p (™') > 2 p ( mt / 8 ) for semantic incremental BPs solving m t -GEN. 
This suffices since p(m t /8) > cmt/ log m t for some constant c > and all t. □ 



5 Open problem 

Nondeterministic thrifty BPs solving BT h (k) were defined and studied jBCM+09] and jWehlOj . The 
definition of nondeterministic thrifty BPs solving DEc{k) is the natural extension of that definition from 
binary trees to arbitrary dags. [GKM08 defined and studied a similar model, nondeterministic semantic 
incremental BPs solving GEN. The open problem is to prove superpolynomial lower bounds for one or both 
models. An interested reader should consult jBCM+09] and |GKM08j for upper bound results and more. 
It should be possible to adapt the main construction in the proof of Theorem [1] (converting deterministic 
semantic incremental BPs into deterministic thrifty BPs) to show that lower bounds for nondeterministic 
semantic incremental BPs follow from lower bounds for nondeterministic thrifty BPs 
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nt > 2 for all t > 1, but regardless we could always just start at some to > 1 to ensure that, without changing the result. 
14 But an easy proof of the converse appears unlikely, due to a subtlety in the definition of nondeterministic semantic incre- 
mental BPs. An interested reader should contact the author of this paper. 
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